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o ■ 

O ■ Abstract 

<N : 

Q . We investigate the convergence rate in the Lyapunov theorem when the third ab- 

^ I solute moments exist. By means of convex analysis we obtain the sharp estimate for 

W ' the distance in the mean metric between a probability distribution and its zero bias 

■ transformation. This bound allows to derive new estimates of the convergence rate 
'. in terms of Kolmogorov's metric as well as the metrics (^^ = li2,3) introduced by 
; Zoiot arev. The estimate for (3 is optimal. Moreover, we show that the constant in 

Ph ' the classical Berry-Esseen theorem can be taken as 0.4785. In addition, the non-i.i.d. 

r~| . analogue of this theorem with the constant 0.5606 is provided. 

^ ■ Our results [Ij concerning the convergence rate in the Lyapunov central limit theo- 

1— I. rem were published in "Doklady Akademii Nauk" (the article was presented by Professor 
^ ! Yu. V. Prokhorov on June 10, 2009). The complete proofs |2j were submitted to the "Theory 
^ I of Probability and its Applications" on June 8, 2009. As it turned out later, independently 
of us Professor Goldstein has obtained some results that coincide with ours. Namely, an 

■ estimate for the proximity in the mean metric between a probability distribution and its 
zero bias transformation, and the upper bound of the constant in the mean central limit 
theorem have been established. His article ^ appeared on arXiv more than two weeks later, 

G^ ; i.e. on June 28, 2009. 

O ' The present paper includes not only the results of [U [2] , but also their improvements. We 

^ ■ show that the constant C in the Berry-Esseen inequality does not exceed 0.4785. Moreover, 

^ . we find a bound for the constant that appears in the generalization of this theorem in 
^ ! the case of nonidentically distributed summands. For this case we obtain the estimate 
■ ■ ' C ^ 0.5606. These new results [4j were presented by Professor A. V. Bulinski to the 
"Russian Mathematical Surveys" on November 17, 2009. 
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1 Introduction 

Consider centered independent (real- valued) random variables (r.v.) Xi, . . . ,X„ with vari- 
ances af,...,a'^ and finite absolute moments . . . , We denote 

n 1 " 

i=i j=i 
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According to the Lyapunov theorem, Sn '■= (-^1 + . . .+X„)/cr(n) converges in distribution to 
the standard normal r.v. when 0. From both theoretical and practical points of view, 
it is very important to estimate the convergence rate in this theorem. It is known O |6] that 
there exists a minimal numerical constant C such that for the Kolmogorov distance between 
Sn and the standard normal variable holds the inequality 

p{Sn,N) := sup|P(5„ ^x)~ P{N ^x)\^ Cen, neN. (1) 

There are plenty of works devoted to estimation of this constant. Esseen [H] showed that 
C ^ 7.5. Bergstrom [7j obtained the bound C ^ 4.8. Takano [8J established that in the case 
of independent identically distributed (i.i.d.) summands C ^ 2.031. Zolotarev P| [TUl[TT| [T^ 
obtained a new inequality allowing to estimate the proximity of two sums of independent r.v. 
With the help of this inequality he showed successively that C ^ 1.322 and C ^ 0.9051, 
while in the case of i.i.d. variables C ^ 1.301 and C ^ 0.8197. The proposed method was 
further developed in the works of van Beek [13j and Shiganov [Hj, who proved the estimates 
C ^ 0.7975 and C ^ 0.7915, respectively. For the sums of identically distributed r.v. 
Shiganov obtained the bound C ^ 0.7655, which was sharpened in 2006 by Shevtsova [T5] . 
She showed that in this case C ^ 0.7056. In [U [2] we derived the estimates C ^ 0.6379 in 
the general case and C ^ 0.5894 for identically distributed summands. In the present paper 
we improve them. 

From a private communication with Korolev and Shevtsova we know that recently they 
have established the convergence rate in the central limit theorem in a variety of sences 
[m HZl HH]. In these works only the i.i.d. case was considered and the bound for the 
constant C is not as sharp as ours. However, interesting estimates of the other kind were 
obtained. 

It is worth mentioning the related problem of determining the asymptotically best con- 
stants in Lyapunov's theorem. As it was shown by Esseen [19], if all the r.v. Xj, j = 1,2, . . . 
have the same distribution, then 



,tasup^(^<C.:=^ = 0.409.... (2) 

and the constant on the right-hand side of this inequality cannot be lowered (hence the lower 
bound C ^ Ci). This result was elaborated by Rogozin [20], who established that under the 
same assumptions 

limsup ^ 62 := , (3) 

n— >oo \j2l{ 

where p{Sn-,M^ := ^nf^p(5'„, G) and M is the set of all normal r.v. 

Chistyakov [211 ESI [23] generalized ([2]) and ([3]) to the case of nonidentically distributed 
summands. He proved that 

p{Sn, N) ^ CiEn + ri(e„), p{Sn,Af) ^ CiEn + r2(en), 

where r2(e„) are o{en) when e„ — 0. 

There are also the estimates of the convergence rate in the Lyapunov theorem provided 
that the moments of the order 2 + 6 exist (see [2^ [25]). 

Analogues of ([T]) are known for other probability metrics as well, for example, Cr (where 
r = 1,2,3). The latter will be described in detail in section 2. Estimates in terms of these 
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metrics can be obtained in a natural way using Stein's method. The proof of the estimates 
mentioned above uses, in particular, the so-called zero bias transformation of a probability 
distribution (see [26]). 

For the distance in terms of metrics ('^ = 1, 2, 3) the following estimates (see [27]) are 
known: 

Cl(^n, N) ^ Sen, C2{Sn, N) ^ ^Sn, N) ^ ^6^. (4) 

Hoeffding [2B| considered the problem of finding the least upper bound of E/(Xi, . . . , X„) 
over the set of all collections of independent simple r.v. satisfying m restrictions of the form 
Egij{Xj) = Cij,j = l,...,n. More precisely, it was estabhshed that in this case one has 
to consider only r.v. taking at most m + 1 values. In the present work results of [28] are 
generalized to the case of arbitrary quasiconvex functional defined on the set of all probability 
distributions. 

The results obtained allowed us to derive an unimprovable estimate for the proximity in 
the mean metric between a probability distribution and its zero bias transformation. The 
latter was used to estimate the accuracy of the Gaussian approximation for the sums of 
independent variates. It was established that the values of constants in (jlj) can be taken 
3 times lower. In addition, our estimate for the metric Cs is optimal. Furthermore, new 
estimates for the difference between the characteristic functions of the normalized sum and 
the standard normal r.v. were derived, which allowed us to prove that C ^ 0.5606 and in 
the case of i.i.d. summands C ^ 0.4785. 



2 Main notions and results 

Let {S, d) be a metric space and denote by Q the set of all finite signed measures on the 
Borel (T-algebra B{S) with the operations of multiplication by a scalar and addition defined 
as follows: let /i, yUi, fi2 & Q, c G M, then for each A G B{S) 

{cfi){A) ■.= c-fi{A), (^i + /i2)(^) := fii{A)+fi2iA). 

It is easy to see that Q forms a linear space. And the set D of discrete probability distributions 
that are concentrated on finite sets of points is a convex subset of Q. The latter means that 
afii + (1 — a)fi2 £ D for arbitrary /ii, /i2 G -D and a G (0, 1). 

Consider the set of all collections consisting of n independent r.v. Xi, . . . , Xn- Then 

E/(Xi,...,X„) = j fdPx,...dPx„, 

where / : — M, Pxi, • • • , are the distributions of Xi, . . . , X„. Thus, E/(Xi, . . . , X„) 
can be regarded as a function on the set of measures, which is linear with respect to each of 
its n arguments. 

A function g : G ^ M., where G is a convex set, is said to be quasiconvex, if for any 
x,y E G and a G (0, 1), we have 

g{ax + (1 - a)y) ^ max{g{x), g{y)}. 

We assume that on S some real- valued functions hi, . . . ,hm are defined. Consider the 

set 

K := {n e D : {hi, fi) = 0, i = 1, . . . , m}, where (/, fi) := j fdjj.. 

s 
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It is easy to see that K is convex. Let Kj be the set of measures n & K that are concentrated 
on at most j points (j G N). 

Theorem 1. For any quasiconvex function g : K W, we have 

sup5((/i)= sup 

IJ.&K tJ.eKm + 1 

In this expression, we assume that the supremum over the empty set is zero. 
Theorem 2. Let f be a nonnegative function on S , V - a linear space with the norm 
\\ ■ \\, A : K ^ V - such a mapping that 

A{afx + (1- a)i^) = aAfi+ {1 - a)Au (5) 

for arbitrary E K,a E (0, 1). Then the least value of 'y such that the inequality 

il^HI ^7(/,/i) (6) 

holds for every measure E K, coincides with the least value of 7 such that (0) is true for 
every measure /i G K^+i- 

Let be a zero-mean r.v. with variance > 0. A r.v. W* is said to have the W-zero 
biased distribution if 

EWfiW) = a'Ef'iW*) (7) 

for every differentiable function / : M — M such that the left-hand side of ([7]) is defined. It 
is known (see [25]) that W* exists for every W as described above and has a density 

(a'^E{W ■!{]¥ >w}), ifw^O; 
^^^^ ~ \a-^E{-W -liW <w}), ifw<0. 



Mrif) := sup 



For every function / G C^"^ ^\'^), where r G N, define 

x-y 

As usual, CW(M) := C(M). If / ^ C^'-^\R), we set M^(/) = 00. Denote 

C(X, Y) := sup{|E/(X) - Ef{Y)\ : / G ^.}, r = 1, 2, . . . , 

where J-'r is the set of all real bounded functions with Mr{f) ^ 1. 

Note that ^1 has alternative representations. These are the so-called mean metric 

/oo 
|P(X < x) - P{Y ^ x)\dx, 
-00 

and the minimal Li-metric 

/i(X,y) :=inf |E|X-y| : Law{X) = Law{X), Law{Y) =Law;(r)|. 



For details see |29l p. 21]. 

Theorem 3. If W is a centered r.v. with unit variance and finite third absolute moment, 
then 

CiiW,W*) ^ ^E\W\^ (9) 



with equality when W has a 2-point distribution. 

CoroUciry 1. Consider a r.v. S* having the Sn-zero biased distribution. Then 

Ci{Sn,S*) ^ -En- 
Theorem 4. The following inequalities are true: 



USn, N) ^ 2C,{Sn, ^ Sn, Cal^n, N) ^ ^Cl(^n, SI) ^ (10) 

usn, N) ^ ^us^, s:) ^ y^. (11) 

The latter double inequality is optimal, namely, for every 6 > there exists such a sequence 
of i.i.d. r.v. Xi, X2, . . ., that 
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For 7 > and t eRwe set 



-t^ + 27a|t|3, 



if 7|t| < M; 



Here 



6(t,7) := ^ -2 (i) (l-cos7t), if M ^ 7|t| ^ 27r; 
0, if7|i|>27r. 



a := max{(cos(x) - 1 + x^/2)/x^} ^ 0.099162, 

a;>0 



and M is the point where this maximum is attained, M ^ 3.995896. 

Denote fs„{t) := Ee^*^", ip{t) := exp (-iV2), 5„(t) \fs^{t) - ^{t)l t e 
Theorem 5. For every t E'M. we have 



IfsM < fi{en:t) :=exp (^-6(t, 2£„) j , 

Sn{t) ^ ^i(£„,i) := £n</?(^) J y ^Xp ds. 



(12) 
(13) 



1 /o 

Define A := Sn /6a. For a// 1 e M t/ie following estimate is true 

\t\ 



^ ^ 

/a ^ / 2 2/3\ 1*1 2 A 

^n^ii) J Y ( ) "^"^ + / ^ ^^P (2a£nS"^) "^"S j , 1^1 > A. 



(14) 



where 



I inf {exp (-^72 + 2at^))} ^ 0.624489. 
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The quantities 5i{e,t) and 52{£,t) can be expressed in terms of the so-called Dawson 
integral 



Daw(t) := exp (— t^) J exp (s^) ds, 







which can be computed by the means of several efficient numerical procedures. For example, 
such a function is available in the GNU Scientific Library (GSL). It is easy to check that 



M^,t)=Mj-;^Daw(-^ 



Moreover, 



exp(^)(Mi^;_^Daw(l^)), A 
exp - ^Daw(l/6ay2)) + fgexp (2aEu') |Ux W > A. 



These representations are of great importance, since they allow to reduce significantly the 
amount of numerical calculations required for the proof of Theorem 7. 

In the case of i.i.d. variables the estimates can be slightly improved. Denote 
Tn := ~3 S"=i <^f ! and let Xi,X2,... be centered i.i.d. r.v. with unit variances and 
finite third absolute moments p. Then En = P/ \pn and r„ = 1/ -Jn. 

Theorem 6. For the sequence of r.v. as defined above and every t G M 

^ :^ ^,^ Kt,e. + l/V^) y^^ ^^^^ 

Am^Ar / A , Ks,en + 1/VE) \'^ s' fs^\ 

dn{t) ^ d3{en,n,t) := en(p{t) llH 1 —expl—jds. (16) 



Let m E N and n ^ m. Then 

\fs„{t)\ ^ f3{en,m,t) :=exp ^^(t, + 1/ V^)^ , (17) 
1*1 

^ f f ffi — 1 S \ s 

Sn{t) ^ S^iSn, m, t) := Sn^it) I CXp ( 6(s, En + 1/ ^/m) + ) y^"^" (^8) 



Estimates (IT^ - (ITS]) allowed to establish the following result. 

Theorem 7. The constant C in inequality ([I]) does not exceed 0.5606, and in the case 
of identically distributed summands C ^ 0.4785. 

3 Proofs 

Proof of Theorem 1. If i^' = 0, then Km+i = 0, and the statement of our theorem is 
true. Further we suppose that the set K is nonempty. 

The sequence of the sets Ki,K2, ■ ■ ■ increases to the set K. Therefore, 

sup g{fi) = sup sup g{fi) = sup sup g{fi). 
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So, it is sufficient to show that 



sup g{fi) ^ sup g{fi) ^ sup g{fi) ^ ... 

fl£Km+l MS-R'm+2 MC-ft'm+a 

Let's take an arbitrary measure /z G Kj, where j > m + 1, and show that there exists 
fi' e Kj_i such that g{fi') ^ fi'(/^)- 

Let II be concentrated in points si, . . . , Sj E S and 

= /ij ^ 0, ii = 1, . . . ,j. (19) 

The vector fi = (/xi, . . . , fij) defines a probabihty distribution, so 

fii + . . . + fij = 1. (20) 

Moreover, the conditions {hi, fi) = 0, i = 1, . . . ,m, hold and therefore 

fii ■ hi{si) + . . . + fij ■ hi{sj) = 0, i = l,...,m. (21) 

Vice versa, an arbitrary vector with nonnegative coordinates satisfying the system of 
hnear equations fl20|) and fl2T]) defines according to f|T9|) an element of the set Kj, and if one 
of its coordinates equals zero - an element of Kj_i. We have m + 1 equations and at least 
m + 2 unknowns, so there exists a nonzero solution u = (z/i, . . . , Uj) of the corresponding 
homogeneous system. Since the sum of the coordinates of this vector is equal to zero, but 
the vector itself is nonzero, it follows that u has both positive and negative coordinates. 
Therefore, there exist the least a ^ and the least (3^0 such that one of the coordinates 
of the vector fl^, = p, — av equals zero and some coordinate of /i* = /i + /5z/ is equal to zero. 
If a = 0, then /i G Kj_i. Otherwise, 

[3 _ a - 

^ = — + — n^/^*' 

a + p a + p 

and because of the quasiconvexity g^ji) ^ viisx{g{iji^),g{n*)}, where are the distribu- 

tions defined by and /i*. Thus, g^ji) ^ g{n*) or g{jji) ^ g{lJ*)- But ji^ and ji* G -ft'j-i- □ 
Proof of Theorem 2. According to Theorem 1, it is sufficient to prove that for every 
fixed value of 7 the function 

g{fi) := -7(/,/i) 
is quasiconvex. Let a + (3 = 1. By the properties of the norm 

p(a/x + pp) II - 7(/, «/i + I3u) = WaAfi + pAu\\ - 7(/, - 7(/, pu) <: 

^ ||aA/i|| + ||/3Az/|| - 7(/, a/i) - 7(/, /5z/) = ^^((/i) + /^^((z/) ^ uiayi{g{fi),g{v)}. □ 

Proof of Theorem 3. We begin by showing that without loss of generality we can 
consider simple r.v. W . It is sufficient to establish that for every r.v. W satisfying the 
conditions of the theorem there exists a sequence {Wn)n^i of simple r.v. with zero means 
and unit variances such that 

Ci(Wn, W:) Ci(W^, W*) and E|W^„|3 ^ E|iy|3, n 00. (22) 
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We suppose that r.v. W is defined on the probabihty space (M, i3(M), P = Pyy) and construct 
a sequence of simple r.v. (VF^)„>i that converges to the r.v. W in L3 norm. We set 



/VaFT^' 

It is easy to see that Wn converges to W in L3 as well. Therefore, the second condition in 
(I22I) is obviously satisfied. It remains to show that the first one also holds. 
From the triangle inequality for the metric Ci one can easily derive that 

\UW,W*) - CiiWn,W:)\ ^ CiiW,Wn) + Ci{W*,W:). (23) 

The first summand on the right-hand side of fl23|) tends to zero, since 

Cl{W, Wn) =h{W,Wn) ^E\W -Wn\ ^ {^\W -Wn\'Y . 

Let's evaluate the second summand. For the function / G JFi we set F{x) := f(u)du. 
Then 

EfiW*) - EfiW:) = EWFiW) - EWnFiWn). (24) 

The difference of expectations on the left-hand side of flMl) does not change if we replace 
the function f{x) by f{x) — /(O). Therefore, we can assume without loss of generality that 
/(O) = 0. Then \ f{x) — f{y)\ ^ — ?/| yields \ f{x)\ ^ \x\ and so |-F(a;)| ^ |a;p, \xf{x)\ ^ |a;p. 
According to the finite-increment theorem, 

WF{W) - WnF{Wn) = {W- Wn) " {xF(x)}' 1,=^= {W - Wn){FiO + ^/(O}, 

where ^ is a number between W and Wn- Moreover, 

\Fio+^fm^m'^2i\w\ + \Wn\)'. 

This gives the estimate 

\E{WF{W) - WnF{Wn)}\ ^ 2E\W - Wn\ {\W\ + \Wn\)^ . 
And finally, the Holder's inequality yields 

E\W - Wn\ i\W\ + \Wn\)^ ^ {E\W - Wnl^y (E {\W\ + {WnD^^. 

Obviously, {E\W — Wn\^)^ 0, since Wn converges to W in L3. Thus, the second summand 
in (I23D tends to zero and ([22]) is fulfilled. 

So, it is sufficient to consider simple r.v. Let Ai be a function that maps the distri- 
bution Px of a r.v. X to its zero-biased distribution P^. Moreover, consider a linear 
operator A2 that maps a signed measure u to its cumulative distribution function (c.d.f.) 
Gtj{x) := z/ ((— 00, a;]). It is easy to see that 

{aPw, + (1 - a)Pw,)* = aP;^^ + (1 - a)P{V2, 

hence the mapping A2 — A2A1 satisfies ([5]). If we set hi{x) = x, h2{x) = — 1 and apply 
Theorem 2 to /(x) = \x\^, A = A2 — A2A1, V - the normed space of integrable functions on 
the real line with the norm 

/oo 
\G{x)\dx, 
-00 

then the problem reduces to the case of simple r.v. taking at most 3 values. C.d.f. of a 
simple r.v. is a staircase function. Using formula one can easily obtain the c.d.f. of 
W*. Therefore, it is not difficult to find the explicit expression for >ii{W, W*). 
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Pic. 1 



Let W take exactly two values —x and y with probabilities p and q. Then its c.d.f. 
is piecewise constant and has two steps in points —x and y that are equal to p and q, 
respectively. Since W is centered, we have px = qy, which together with ([8]) yields that W* 
is uniformly distributed on [—x,y]. Therefore, on [— x,?/] its c.d.f. is linear, and its graph is 
a segment that connects (— x, 0) and [y, 1). By definition xiiW, W*) equals the area of the 
figure bounded by distribution functions of these r.v. (in the case considered it is a union of 
two triangles, see pic. 1, left). 

It follows from the conditions EW = Q and E W"^ = 1 that x = \/qJp, y = \fpjq- Hence 

E = + gy^ = gj- (25) 

VP V 9 

Let's find the area of the figure bounded by c.d.f. of r.v. W and W* . Density of W* is 
px = qy = ^Jpq. Thus, the slope of the c.d.f. of this r.v. on [-x,y] is ^Jpq. The length of the 
vertical leg of the first triangle is p, and that of the second one is q. Hence, the total area of 
both triangles is 




Therefore, if W takes exactly two values, there is equality in ([9]). 

Consider the case when W takes three values. We assume without loss of generality that 
two of them (—a < —h) do not exceed zero and one (c) is positive. As before, the c.d.f. of 
W is piecewise constant and the c.d.f. of W* is piecewise linear. However, the form of the 
figure bounded by them is more complicated (see pic. 1, right). Denote by R the value of 
the c.d.f. of W* at —h and by S its value at 0. Let W take values —a, —6, c with probabilities 
p,q,r, respectively. Then, because of the moment-type restrictions, 

p + q + r = 1, 
—pa — qb + rc = 0, 
pa? + qb"^ + rc^ = 1. 

It is a system of linear equations with respect to p, g, r. Using Cramer's rule, we obtain 

p = (1 - 6c)(c + 6)/A, g = (ac- l)(a + c)/A, r = (a6 + l)(a - 6)/A, 

where A = {a + c){h + c){a — h) . Thus, every r.v. with zero mean and variance 1 that takes 
three values is uniquely determined by these three values. It is easy to see that p, g, r are 
nonnegative iff 

ac ^1, 6c ^ 1. (26) 

In other words, a r.v. W taking the values —a, —6, c exists iff (l26l) is satisfied. Our aim is to 
prove that the function 

g{a,h,c):=x,{W,W*)-]^£\W\' (27) 
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does not exceed zero. Its explicit form in terms of variables a, b, c depends on how the c.d.f. 
of r.v. W and W* are located with respect to each other. There are 5 cases: 

I. R ^ p, S ^ p + q, or, equivalently, a(a — 6) ^ 1, c ^ 1. In this case 

g(a, b,c) = pa^ — qb^. 

c 

II. R ^ p, S ^ p + q ^ a{a ~ b) ^ 1, c ^ 1. 

T 

g(a, b,c) = pa^ — qb^. 

c 

III. R^ p, S ^ p + q a{a — 6) ^ 1, c ^ 1 (the latter implies c(6 + c) ^ 1). 

f 1 1 ^ r 

g(a, b, c) = pa < a — b > H pa^ — qb^. 

I a } c 

W . p ^ R ^ p + q, S ^ p + q ^ a{a - b) ^ 1, c{b + c) ^ 1, c ^ 1. 

g(a, b,c) = pa < a — b I H pa^ — qb^. 

V. ^p + g<^c(6 + c) ^ 1. 

g{a,b,c) = rcl 

a 

Note that in each of these cases g is the same function defined by (12 7p . As a result, if the 
values a, b, c satisfy the restrictions of two cases simultaneously, then for the function g we 
can use the expression corresponding to any of them. 

As one can see, in the cases I, II, and in the cases III, IV the function g has the same 
representation. Therefore, further we distinguish three possibilities: 

A. a{a - 6) ^ 1. 

g(a, b,c) = pa^ — qb'^ . 

c 

B. a{a-b) ^ 1, c{b + c) ^ 1 

(?(a, b,c)=pa\a — b I H pa^ — qb^. 



C. c(6 + c) ^ 1. 



g{a,b,c) = rc^ 

a 



We show that in each of the cases A, B and C the function g does not exceed zero. 
Case A. 

, , , r o ,o (l + ab)(a-b) (1 - bc)(b + c)a^ (ac - l)(a + 

g(a, b,c) = - - pa-^ - qb'^ = -f^ - ^ ^ ^ '-j- — = 

^ ^ c Ac A A 

(a - 6)(ac - 1)(1 - be) 



Ac 

because of (l26ll . nonnegativeness of a, b, c and the fact that a > b. 



1 — be — ac — ab) ^ 
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Case C. 



a Aa A 

= |_ + 1) (a - b)] - [1 + 6(a - 6)] c - 6} . 

Aa 

Since ac ^ 1, it suffices to prove that the expression enclosed by braces does not exceed 
zero. Consider this expression as a function of the variable c while holding the others fixed. 
When c — 0, this function equals —b ^ 0. Moreover, it decreases with respect to c, since the 
coefficients of terms c and are negative. Consequently, for all positive values of c it does 
not exceed zero. 
Case B. 

/ . A f . ^ 3 .3 {i-bc){b+c)a r , n' 

g(a, b,c) = pa < a - b > H pa^ - qh= ^ — {a-b > + 

y a ) c A y a ) 

{l + ab){a-b) {l-bc){b + c)o? (ac-l)(a + c)&3 _ l-bc ^ ^ , l 
+ A^ A A -~A^^^'^ + krc + ko\. 

where ka = 1 — 2a(a — + ab), ki = 6 [1 — a{a — b){l + ab)] , k^ — a{a — b){l + ab). 
Assume that g{a, b, c) > 0. Due to the condition a{a — b) ^ 1 one has 

1 - a(a - b){l + afe) ^ 1 - (1 + ab) = -ab ^ 0, 
1 - 2a{a -b){l + ab) ^1- 2(1 + ab) = -1 - 2ab < 0. 

Therefore, k2 and A;i do not exceed zero and, consequently, k2c'^ + kic + fco decreases with 
respect to c. Consequently, if one reduces the value of the variable c while holding a and b 
fixed, g will remain positive. The variable c is bounded from below by two conditions: 

ac ^ 1 and c{b + c) ^ 1. 

The first of these conditions can be omitted, since it follows from the other two: 

c{b + c) ^ 1 and a{a — b)^l. 

Indeed, let ac < 1. Then 

1 ^ c(& + c) < - ( 6 + - ) =^ < a6 + 1 =^ a(a - 6) < 1. 



a \ a 



Therefore, we can reduce c to the value such that c*(6 + c*) = 1. And g will remain 
positive. But the situation, when c{b + c) = 1, satisfies the restrictions of the case C, for 
which we established that (7 ^ - a contradiction. □ 

Proof of Corollary 1. Without loss of generahty assume a — 1. Let / be a random index 
taking values 1, . . . , n with probabilities cr^, . . . , o"^, independent of Xi, . . . , Construct 
on an extended probability space 

S[:=Y,X^ + Xl z = l,...,n. 
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where X* has the Xj-zero biased distribution and is independent of /, Xi,...,X„, 
i = 1, . . . ,n. Then S"* = S'j (see [21]). Therefore, for an arbitrary function f E J-'i one 
has 

n n n 

Ef{S:) = Ef{S',) = ^fiS'iW = k} = Y, mS'Ml = k} = Y.^lEf{S',). (28) 
Consequently, 



k=l 



k=l 



k=l 



\Ef{S„,)-Ef{S:] 



J2^imsn)-Y.4mSk) 



k=l 



k=l 



k=l 



fc=l fc=l ^ ^ fe=l A:=l 

Here we used the statement of Theorem 3 for r.v. — X^ as well as the property of homogeneity 



of the metric Ci (i- e. Ci(cX, cY) = cCi(X, Y)) and (aX)* = aX*. □ 
Proof of Theorem 4. It is easy to see that for continuous / : M - 



h{w) := e- J {f{w) - Ef{N))e--dx 

— oo 

satisfies the Stein's equation 

h'{w) - wh{w) = f{w) - Ef{N). 

Hence 

\Ef{Sn) - E/(X)| = \Eh'{Sn) - ESMSn)\ = \Eh\Sn) " Eh' {S:)\ ^ M2{h)Ci{Sn,S:). 
As it was shown in 



M2{h) ^ min{2Mi(/),^M2(/),^M3(/)}. 
Taking into account that Mr{f) ^ 1 in the definition of the metric (r, one has 



CiiSn, N) ^ 2Ci(5„, S:), C2(5n, N) ^ ^Ci(5n, S^J, ^S^, N) ^ ^Ci(5n, SI). (29) 



Estimates in terms of are obtained by applying Corollary 1 to 

Let's prove the optimality of ([II]). We set /(x) := Then M^^f) = 1 and E /(X) = 0, 

since the r.v. X is symmetric and the function / is odd. Consider Xi, X2, ... - a sequence 
of i.i.d. variables with zero means and unit variances. Then 



E S„ 



EXf 



n 



and En 



ElXil 



n 



As a result, we have 



^^i^^|E/(5„)-E/(X)| = l-"^^'" 



6E|Xi|3' 
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It only remains to prove that | EXl\/ E|Xip can be arbitrarily close to unity. According 
to fl25p . the third absolute moment of a centered r.v. Xi with variance 1 taking two values 
^ = ~\/ llv-, V = \fvTk with probabilities p and g, respectively, is equal to 

E|Xi|3 = g./^ + p./^. 

VP V 9 

It's easy to see that the third moment of this r.v. equals 

VP V ^ 

Obviously, | E Xi | / E |Xi ^ ^ 1 when j9 1. □ 

Lemma 1 ([30j). Let W be centered r.v. with variance 1 and finite third absolute 
moment (3. Denote f{t) := Ee'^^ , t G M. Then for allt eR 



\f{t)\'<:i + b{t,p + l), 

moreover, 

|/5„(t)|' ^exp {b{t,en + Tn)). 

Lemma 2. For every t G M the function b{t,'y) is nondecreasing with respect to 7. 
Proof. This can be checked directly by calculating the derivative. □ 
Lemma 3. If W is a centered r.v. with variance 1 and fit) = Ee^^^ , then 

\f{t)-^{t)\^^{t) I \f{s)-f*{s)\sexp(^^^ds, tGM, 



(30) 
(31) 



(32) 



where f*{t) is the characteristic function of a r.v. W* having the W-zero biased distribution. 
Proof. According to the definition of the Vl^-zero biased distribution, 

fit) = iEWcos{tW) -EWsm{tW) = -ttEsm{tW*) - tEcos{tW*) = -tEe''^\ (33) 

Consider the function ip{t) := Note that ip{0) = 1. Taking into account (133|) . we have 

d 



^'{t) = - (/(t)e^*^) = f(t)e^''+tf{t)e^'' = {f{t) - f*{t)}te'^'\ 



Then 



fit) 



ifit) 



1 



\m-m\ ^ fw{s)\ds= f\f{s) 

Jo Jo 



f*{s)\se^' ds. □ 



Lemma 4. For arbitrary r.v. X and Y we have 

lEe**^ - Ee**^| ^ tCiiX,Y). 



Proof. It is well known that for all t, x, y G M holds the inequality 

^ t\x-y\. 



litx _ 
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Thus, for arbitrary X, Y defined on one probability space such that Law{X) = Law{X) and 
Law{Y) = Law{Y), we have 



Ee**^ - Ee 



UYl 



Ee^*^ - Ee 



itY 



€ E 



^tE\X -Y\. 



(34) 



Passing in fl34l) to the greatest lower bound among every possible X, Y, we obtain 

I E e**^ - E e^*^ | ^ th {X, Y) = tCi {X, Y) . □ 

Proof of Theorem 5. The inequality (fT2|) is a consequence of Lemma 1. Indeed, 
according to the Lyapunov inequality, we have cr| ^ = 1, . . . ,n. Hence r„ ^ £„. Now 
(fT2l) follows from (13T|) and Lemma 2. 

Further we assume without loss of generality that cr = 1. Denote fj{t) := Ee**"^^ , : = 



Ee 



itx* 



n, and set W := Sr, in 



Using Lemma 4 and Corollary 1 we get 



\f{s)-r (5)K<i(5„,5:)^ 



Substituting the latter into fl32l) . we arrive at f|T3|) . 
According to 



/* is) = J2 ^^Ee-^'" = E ^-/-(^) n /^■(^)- 

m=l jT^fn 



m=l 



Therefore, 

\m-r {s)\ 



From Lemma 4 and Theorem 3 we have 



\fAs)-f;{s)\ < sux„x;) = sa.Ci {—,— ]< 



X, X 



It follows from fl5U]) that \ fj{s)\ ^ exp(— + 2/?ja|sn for all real s. As a result 



(35) 



(36) 



\fis)-r {s)\^ 



exp 



i=l 



4— + 2j3jas' 



2/5,as^ 



exp 1 + 2£n<3'S 



Since o"| ^ /3j ^ for j = 1, . . . , n, we have for such j 



exp 



^ 2^^at^ ) ^ exp 



exp ( ^ — 2as^ 



S=t(TA 



^ sup jexp - 2as^^ : s E [0,tel/^] 



(37) 



(38) 
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The function exp — 2as'^^ increases on the segment [0, l/6a] and at the point l/6a it 
attains its global maximum equal to l/l. Therefore, 



sup <^ exp ( — — 2as^ ) : s G [0 



^^1/3] 1 = I^^P - 2a£„t3j , if te'J^ ^ l/6a; 

J [l/^ otherwise. 



(39) 



Combining dSZD, dM]) and ([31]) gives for sel/'^ ^ l/6a 



l/(^)-r(^)K^exp(^-|\, 



and for sel/^ > l/6a 



\m~r (.)K^exp(-|- + 2a5„.^ 



Substituting the expressions obtained into fl32l) . we get the required estimates. 

Proof of Theorem 6. At first we prove f|T5l) . Denote fi{t) := Ee**"^^. According to 
Lemma 1, 



n 



n \ \ n 



l + -6(t,£„ + r„). (40) 
n 



Now (USD follows from the fact that /5„(t) = f^it/^). 

To establish (IT7|1 we note that 1 + x ^ for all real x. Applying this inequality to (JTSj) 
gives 

|/5„(t)| ^expfi6(t,£„ + r„,)y (41) 



It remains to note that the sequence (rm)m^i decreases, which leads to (fT7|) . 

We set W := S'„ in Lemma 3. Applying to the r.v. • • • , :^^n yields 



i/(^)-r(^)i 



/i — 

n 



n-l 



The first factor can be estimated with the help of fl40l) and the second - by means of (|36l 
We have 



i/(^)-r(^)i^ 1 + 



6(s,e„ + r„) 



SriS 



(42) 



Substituting the expression obtained into (l32l) . we get (fT6|) . To establish (fT8|) we apply the 
inequality 1 + x ^ to the first factor on the right-hand side of and note that {Tm)m^i 
is decreasing. □ 

Proof of Theorem 7. Let D{£, n) denote the least quantity such that for every collection 
consisting of n r.v. Xi, . . . , X„ with e„ = e holds the inequality 



p{Sn,N)^D{e,n)en. 
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We set 



D{e) := sup D{e, n). 



Then the constant C can be determined as 

C = supD{e). 

e>0 

Hence, it suffices to show that for all possible values of e and n the quantity D{e, n) ^ 0.5606 
(and in the case of i.i.d. r.v. D{e,n) ^ 0.4785). For e ^ 1/0.5606 (respectively, 
e ^ 1/0.4785) the latter is obvious, since p{Sn,N) ^ 1. 

Moreover, denote A„ := a^{n)/{a^{n) — max af) and set 

k=l,...,n 



k=i 



Then, according to the inequality (1.52) from [21], for + e'^ ^ 0.2 we have 

p(S'„, A^) ^ 0.27283e„ + 0.19948^ + 0.09116e;; + 0.00095(e„ + e'^f. 



(43) 



Assume without loss of generality that a = 1. Then the Lyapunov inequality yields 
c^j ^ Pk ^ Ylk=i /^fc = ^n, j = 1, • • • , n. Hence ^ (1 - en '^)~'^. In addition, ^ e„ and, 
as it was shown in [3T], e'^ ^ {e'^)^^^. From these inequalities and fH3|l it follows easily that 
D{e) ^ 0.5606 when e ^ 0.02. 

In the case of i.i.d. summands e„ = Pl/{af^/n) ^ \l\fn. Thus, n ^ [l/e^^l and 



Ar 



n 



^o(£^n) 



n-1 no(£:„)-l' 



(44) 



where n^ie) := [1/e^]. Moreover, 



An 



3/2 



and e!L = — . 

n 



(45) 



Combining (gS]), (glD and (gSD yields £>(£) ^ 0.4785 when e ^ 0.037. Therefore, in the 
general case we have to consider e from the segment Ji = [0.02; 1/0.5606] and in the case 
of i.i.d. r.v. - from I2 = [0.037; 1/0.4785]. The proof for these values of e is based on an 
inequality due to Prawitz 



p{Sn,N) ^ 1 



( Uo 



1 

U 



K 



\Uo 



+ 



• \5n{u)\du + 









/ V 


K 











Uo<\u\igj 



Uo 



-Uo 



where K{u) := |(1 



|m|) + |((1 



u \u 



\u\) cot{nu) 



2tiu 



\({>{u)\du + 



l/n( 


u)\du+ 


/ 




du 




2nu 





\ 



sgnju) 



\u\>Uo 

0<Uo^U. 



, (46) 
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It follows from (H^ that D{e) does not exceed the quantity D*{6, Uq, U), which arises on 
the right-hand side of (H6|) when we substitute 6n(t) with its estimate mm{Si{e,t),62{e,t)}, 
|/n(^)| - with the estimate f\{e,t) and select such parameters Uq^U that the resulting ex- 
pression was as little as possible. This procedure was carried out with the aid of computer 
for several hundreds values of e dispersed on the segment Ji. To obtain the estimates for 
the intermediate points we used the following property of the quantities D* , which holds 
due to the monotonicity of the functions fi, . . . , and , . . . , ^4 with respect to their first 
arguments. 

Z}*(5«, f/o, U) ^ ^D*iE^'\ Uo, U), e« < 5(2). (47) 

The extremal value of the quantity D*{e,UQ,U) = 0.56054 was attained for 
e = 0.5092, Uo = 2.4852, U = 5.9508. 

In the case of i.i.d. r.v. the estimates were constructed in a different way. 

For the fixed value of e we estimated the quantities D{e,n),n ^ 1, separately . For 
n < m, where m is some natural number, the individual estimates of D{e,n) were given. 
On the right-hand side of fj^6|l we substituted 6n{t) and \fn{t)\ with their upper estimates 
53(5, n,t) and f2{e,n,t). After that the computational procedure as described above was 
carried out to select the optimal parameters U and Uq. For n ^ m the quantities D{e, n) were 
estimated uniformly. On the right-hand side of (H^ the estimates 64(6, m,t) and f^{e,m,t) 
were used. As before, it was sufficient to carry out the calculations only for the finite number 
of points, since a property similar to (j47j) holds in this case as well. For the i.i.d. r.v. the 
extremal value 0.47849 was attained for e = 0.3536, n = 8, Uq = 2.6157, U = 8.9115. 

Thus, the constant C does not exceed 0.5606. And if we restrict to the case of i.i.d. r.v., 
we have C ^ 0.4785. 
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